Skip to content

Conversation

@RiverDave
Copy link
Collaborator

@RiverDave RiverDave commented Nov 20, 2025

The tests for all kernell calls are originally located at: https://github.com/llvm/clangir/blob/main/clang/test/CodeGenCUDA/kernel-call.cu we might need to re-organize all of our already existing tests for parity and correctness in the future. If anybody's interested in what stream-per-thread actually means: https://developer.nvidia.com/blog/gpu-pro-tip-cuda-7-streams-simplify-concurrency/

@RiverDave RiverDave changed the title [CIR][CUDA] Support stream per thread [CIR][CUDA] Add Support for stream per thread Nov 20, 2025
@RiverDave RiverDave force-pushed the users/riverdave/cuda-per-thread-stream branch from 9949967 to 4069ee9 Compare November 20, 2025 23:04
@github-actions
Copy link

github-actions bot commented Nov 20, 2025

✅ With the latest revision this PR passed the C/C++ code formatter.

Copy link
Member

@bcardosolopes bcardosolopes left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM minor pending nits. (@koparasy, any feedback?)

LangOptions::GPUDefaultStreamKind::PerThread)
llvm_unreachable("NYI");
LangOptions::GPUDefaultStreamKind::PerThread) {
if (cgf.getLangOpts().HIP)
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Please add tests that also cover HIP!

@koparasy
Copy link
Contributor

LGTM, please add the HIP test as well

@RiverDave RiverDave changed the title [CIR][CUDA] Add Support for stream per thread [CIR][CUDA][HIP] Add Support for stream per thread Nov 21, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants